Enriching SerbianWordNet and Electronic Dictionaries with Terms from the Culinary Domain
نویسندگان
چکیده
In this paper we present three lexical resources for Serbian that are crucial for the development of applications in the culinary domain based on natural language processing. The first two of them — Serbian WordNet and morphological edictionaries — have already been in development for some time, while the third one – a corpus of culinary recipes -– has been developed specifically for this purpose. In this paper, we present how we use each of these resources to correct and enlarge the other two. We use various automatic procedures, but manually check all the results.
منابع مشابه
Medical Students’ Perception of Using Electronic Learning Tools in an ESP Program
Given the burgeoning interest in the use of technology and electronic tools for educational purposes among students, this study set out with the purpose of investigating medical students’ perception of using e-learning tools and applications in an English for Specific Purposes (ESP) program at an Iranian medical university. The study also aimed to discover the extent to which the students...
متن کاملOn multiword lexical units and their role in maritime dictionaries
Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...
متن کاملEquivalence in Technical Texts: The Case of Accounting Terms in English-Persian Dictionaries
Translating accounting documents, in general, and accounting terminology, in particular, is not a simple task, especially when the new terms keep created in pace with accounting developments. This study was carried out to find the most common and preferable ways to translate accounting terms from English into Persian. Also, an attempt was made to identify the frequently used patterns of word-fo...
متن کاملAutomatic Annotation of Medical Records in Spanish with Disease, Drug and Substance Names
This paper presents an annotation tool that detects entities in the biomedical domain. By enriching the lexica of the Freeling analyzer with bio-medical terms extracted from dictionaries and ontologies as SNOMED CT, the system is able to automatically detect medical terms in texts. An evaluation has been performed against a manually tagged corpus focusing on entities referring to pharmaceutical...
متن کاملEnriching Slovene WordNet with domain-specific terms
The paper describes an innovative approach to expanding the domain coverage of wordnet by exploiting multiple resources. In the experiment described here we are using a large monolingual Slovene corpus of texts from the domain of informatics to harvest terminology from, and a parallel English-Slovene corpus and an online dictionary as bilingual resources to facilitate the mapping of terms to th...
متن کامل